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This  report  introduces  a  plan  for  the  development  of  a  theoretically  based  computational 
scheme  of  natural  language  generation  for  a  translation  system.  The  emphasis  of  the  project 
is  the  mapping  from  the  lexical  conceptual  structure  of  sentences  to  an  underlying  or  “base” 
syntactic  structure  called  deep  structure.  This  approach  tackles  the  problems  of  thematic 
and  structural  divergence,  i.e.,  it  allows  generation  of  target  language  sentences  that  are 
not  thematically  or  structurally  equivalent  to  their  conceptually  equivalent  source  language 
counterparts.  If  the  endeavor  succeeds,  knowledge- based  inferencing  will  not  be  necessary, 
and  lexica]  selection  and  syntactic  realization  will  be  facilitated. 
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ABSTRACT:  Current  approaches  to  generation  for  machine  translation  make  use  of  direct- 
replacement  templates,  large  grammars,  and  knowledge- based  inferencing  techniques.  Not 
only  are  rules  language- specific,  but  they  are  too  simplistic  to  handle  sentences  that  exhibit 
more  complex  phenomena.  Furthermore,  these  systems  are  not  easily  extendable  to  other 
languages  because  the  rules  that  map  the  internal  representation  to  the  surface  form  are  en¬ 
tirely  dependent  on  both  the  domain  of  the  system  and  the  language  being  generated.  Finally 
an  adequate  interlingual  representation  has  not  yet  been  discovered;  thus,  knowledge-based 
inferencing  is  necessary  and  syntactic  cross-linguistic  generalization  cannot  be  exploited. 

This  report  introduces  a  plan  for  the  development  of  a  theoretically  based  computational 
scheme  of  natural  language  generation  for  a  translation  system.  The  emphasis  of  the  project 
is  the  mapping  from  the  lexical  conceptual  structure  of  sentences  to  an  underlying  or^base”^ 
syntactic  structure  called  deep  structure.  This  approach  tackles  the  problems  of  thematic 
and  structural  divergence,  i.e.,  it  allows  generation  of  target  language  sentences  that  are 
not  thematically  or  structurally  equivalent  to  their  conceptually  equivalent  source  language 
counterparts.  Two  other  more  secondary  tasks,  construction  of  a  dictionary  and  mapping 
from  deep  structure  to  surface  structure,  will  also  be  discussed. 

The  generator  operates  on  a  constrained  grammatical  theory  rather  than  on  a  set  of 
surface  level  transformations.Jf  the  endeavor  succeeds,  there  will  no  longer  be  a  need  for 
large,  detailed  grammars;  generad  knowledge- based  inferencing  will  not  be  necessary;  lexical 
selection  and  syntactic  realization  will  be  facilitated;  and  the  model  will  be  general  enough 
for  extension  to  other  languages.  \ _  ^  ^  j 
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1  Introduction 


This  report  introduces  a  new  scheme  for  natural  language  generation  based  on  lexical  concep¬ 
tual  structure,  which  represents  meaning  through  predicate  decomposition.1  For  example, 
the  word  coptine  would  be  represented  as:2 

(event  CAUSE  (thing  X)  (poss  BE  (thing  X)  (thing  Y))  (property  FORCEFULLY)) 

In  other  words,  capture  is  viewed  as  an  event  in  which  an  agent  (X)  forcefully  causes  a  theme 
(Y)  to  be  possessed  by  the  agent. 

The  goal  of  the  project  is  to  produce  a  language-independent  system  suitable  for  a  gen¬ 
eration  component  of  a  machine  translator.  Lexical  conceptual  structure  is  used  to  ease 
the  complicated  operations  associated  with  generation,  lexical  selection,  and  syntactic  re¬ 
alization.  In  particular,  these  operations  are  difficult  when  semantically  equivalent  source 
and  target  language  verbs  are  not  thematically  or  structurally  equivalent.  This  situation  is 
usually  apparent  when  there  is  a  choice  between  two  or  more  target  language  translations. 
For  example,  the  English  word  slash  might  be  translated  as  the  Spanish  word  cortar  (liter¬ 
ally,  to  cut),  or  the  composite  Spanish  form  dar  cuchilladas  a  (literally,  give  knife-wounds 
to).  The  correct  lexical  selection  and  syntactic  realization  of  the  surface  form  in  such  cases 
is  based  on  a  systematic  mapping  between  the  lexical  conceptual  forms  of  the  source  and 
target  languages.  This  will  be  discussed  in  more  detail  in  section  4.3. 

Previously,  generation  systems  did  not  provide  a  representation  of  “meaning”  for  the 
verbs  being  generated;  rather,  language  dependent  templates,  inferencing  procedures,  and 
network  searching  rules  were  used  to  select  the  target  language  verb.  Such  systems  did 
not  take  predicate- argument  structures  into  account;  thus,  they  could  not  explain  thematic 
or  structural  divergence.  Furthermore,  cross-linguistic  generalization  was  ignored  since  the 
templates  and  networks  were  specifically  tailored  to  the  languages  handled  by  the  system. 
The  approach  described  here  does  not  make  use  of  language-dependent  devices  found  in  older 
systems.  Instead,  verbs  are  defined  in  terms  of  many  semantic  components  that  contribute 
to  the  overall  meaning;  these  composite  structures  can  then  be  mapped  cross-linguistically 
in  order  to  arrive  at  target  language  forms. 

The  next  five  sections  describe  the  generation  system.  The  second  section  provides  the 
background  for  natural  language  generation  in  machine  translation.  First,  a  brief  description 
of  the  theory  behind  the  generation  scheme  is  given.  Then  existing  generation  schemes  will 
be  discussed  and  their  shortcomings  will  be  addressed. 

A  plan  for  the  development  of  a  theoretically  based  computational  scheme  will  be  in¬ 
troduced  in  the  third  section.  Three  components  of  the  system  will  be  identified:  (1)  the 
dictionary;  (2)  the  morphological/syntactic  synthesizer;  and  (3)  the  module  that  maps  lex¬ 
ical  conceptual  structure  to  deep  structure.  The  third  component  is  the  emphasis  of  this 
discussion. 

'The  representation  adopted  here  is  as  formulated  by  Hale  and  Laughren  (1983),  and  Hale  and  Keyser 
(1986). 

2The  modifiers  posit,  poss  and  ident  stand  for  positional,  possessional  and  identificational  respectively. 
Words  in  upper-case  are  the  primitive  units  of  meaning. 
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How  the  scheme  embodies  linguist i»-  1  henry  will  be  explained  in  the  fourth  section.  Kx- 
atnples  of  problems  that  might  be  encountered  during  generation  of  English  and  Spanish  will 
be  presented.  Finally,  the  goals  of  the  scheme  will  be  described. 

The  filth  section  presents  a.  description  of  the  work  that  needs  to  be  done  in  order  to 
accommodate  the  scheme.  The  generator  will  replace  the  generation  component  that  is 
currently  part  of  the  I  NI  I  RAN  machine  translation  system.*  The  new  generator  will  operate 
on  a  constrained  grammatical  t  heory  rather  than  on  a  set  of  surface  level  transformations. 
The  basic  building  blocks  of  the  system  will  be  discussed.  Also,  methods  of  testing  and 
evaluating  the  system  will  be  presented. 

In  the  sixth  section,  some  of  the  difficulties  that  might  arise  in  the  development  of  the 
scheme  are  addressed. 

2  Background  for  Research 

This  section  introduces  the  background  for  a  generation  scheme  based  on  lexical  conceptual 
structure,  and  provides  a  description  of  three  other  commonly  used  generation  designs:  (1) 
direct-replacement,  (2)  syntactic-based  generation  and  (3)  semantic-based  generation.  The 
advantages  and  disadvantages  of  these  three  designs  will  be  discussed.  Finally,  it  will  present 
arguments  for  why  a  design  based  on  lexical  conceptual  structure  is  an  improvement  over 
other  designs. 

2.1  Lexical  Conceptual  Structure  Approach  to  Generation 

The  work  of  Jackendoff  ( 1972)  has  influenced  much  of  the  lexical-semantic  work  of  the  Lex¬ 
icon  project  at  MIT.  The  representation  adopted  is  lexical  conceptual  structure  (henceforth 
LOS).  According  to  Rappaport.  and  Levin  (1986),  LOS  encodes  a  verb’s  meaning  through 
predicate  decomposition.  For  example,  the  LOS  for  the  word  put  is: 

(event  (posit  HOVE  (thing  X)  (thing  Y)  (place  Z))) 

Linking  rules  relate  variables  in  the  I.CS  to  the  variables  in  the  predicate-argument  structures , 
which  provide  an  explicit  representation  of  hierarchical  relations  between  the  verb  and  its 
arguments.  For  example,  the  predicate-argument  structure  for  put  is:3 4 

X  <y  P-loc  z> 

The  linking  rules  that  relate  the  LOS  to  the  predicate  argument  structure  associate  thematic 
roles  (henceforth  0-roles)  like  agent,  theme,  and  recipient  with  variables.  An  example  of  such 
a  linking  rule  is: 

3 See  Dorr  (1987). 

4This  form  of  the  predicate-argument  structure  is  taken  from  Rappaport  and  Levin  ( 1986).  The  variables 
outside  the  brackets  are  external  arguments,  and  the  variables  inside  the  brackets  are  internal  arguments. 
Henceforth,  I  will  be  representing  such  structures  as  annotated  bracketed  forms  that  correspond  directly  to 
tree  structures;  in  this  form  external  arguments  correspond  to  positions  outside  the  maximal  projection  of 
the  verb,  and  internal  arguments  correspond  to  positions  inside  the  verb’s  maximal  projection. 


Link  the  agent  role  with  the  external  argument  variable  in  the 
predicate-argument  structure. 


Hie  verb  is  then  stored  in  the  lexieon  with  its  IX'S  and  the  0-roles  it  assigns  to  the  variables 
of  t  he  IX'S: 


PUT:  (event  (posit  MOVE  (thing  X)  (thing  Y)  (place  Z))) 

X  =  agent,  Y  =  theme,  Z  =  locatum 

The  relations  between  the  verb  and  its  arguments  are  then  manifested  as  grammatical  func¬ 
tions  in  the  syntactic  underlying  form  of  the  sentence.  The  following  illustrates  an  underlying 
form  containing  the  verb  put: 

(1)  is  [np  I)  [vp  put  [NP  the  bookj  Ipp  >»  the  box]]] 

2.2  Early  Generation  Designs:  Direct  Replacement 

Several  generation  systems  have  used  a  direct  replacement  scheme  (see  Brown  (1974),  Forbus 
and  Stevens  (1981),  Swartout  (1981),  and  Winograd  (1982)).  Essentially,  the  technique 
involves  templates  that  map  an  internal  representation  into  surface  text.  As  an  example,  we 
will  look  at  the  generation  of  text  from  internal  concepts  as  found  in  Swartout’s  XPLAIN 
system  (1981 ). 

The  XPLAIN  phrase  generator  maps  the  concepts  to  phrases.  For  example,  the  concept: 
((pvcs*f  dangerous) *f  (induced*o  (by*o  digitalis))) 


is  mapped  to  the  phrase: 


dangerous  pvcs  induced  by  digitalis 

In  order  for  this  mapping  to  take  place,  a  set  of  templates  are  used.  The  tie  of  each 
concept  (indicated  by  a  letter  preceded  by  an  asterisk  (*))  points  to  the  template  that 
produces  text  for  that  concept.  In  the  concept  above,  *f  indicates  that  the  second  element 
in  the  list  is  a  modifier  of  the  first  element;  and  *o  indicates  that  the  second  element  is  the 
object  of  the  first  element.  The  template  associated  with  *f  places  the  second  element  (the 
modifier)  before  the  first  element  if  it  is  a  single  word  or  adjective;  otherwise,  the  modifier 
is  placed  after  the  first  element.  Thus,  (pvcs*f  dangerous)  is  mapped  to  dangerous  pvcs , 
whereas  (block*f  (on*o  the  table))  is  mapped  to  block  on  the  table.  The  template 
associated  with  *o  places  the  second  element  (the  object)  after  the  first  element.  Thus, 
(by*o  digitalis)  is  mapped  to  by  digitalis. 

The  advantage  to  using  a  direct  replacement  scheme  is  that  expressions  that  are  part 
of  the  (domain-dependent.)  internal  representation  for  concepts  can  be  mapped  directly  to 
surface  text  without  the  need  of  an  underlying  linguistic  representation  of  the  surface  form. 
However,  the  disadvantages  of  the  approach  greatly  outweigh  this  advantage.  First,  gram¬ 
matical  relations  are  identified  by  means  of  ad  hoc  rules  that  are  implicit  in  the  templates; 
not  only  are  these  rules  language-specific,  but  they  are  too  simplistic  to  handle  sentences  that 
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exhibit  more  complex  phenomena  (like  raising  and  embedded  sentences ).  Second.  I  he  system 
is  not  easily  extendable,  nor  is  its  design  readily  I ransparent .  because  the  rules  mapping  the 
internal  representation  to  the  surface  form  are  entirely  dependent  on  both  the  domain  of  the 
system  and  the  language  being  generated.  ' 


2.3  Syntactic  Approach  to  Generation 


:<.f  t 

VA 

M' .■ 

y*V 


As  Chomsky’s  transformational  paradigm  quickly  gained  popularity  in  the  I96(l’s,  machine 
translation  systems  began  to  take  a  phrase  structure  approach  to  both  parsing  and  gener¬ 
ation.  However,  these  systems  were  not  based  on  a  theory  of  unirrrsal  grammar  as  is  part 
of  Chomsky’s  Government- Binding  (henceforth  CB)  theory  (see  Chomsky  ( 1 98 1 ) )-  Rather 
than  taking  an  approach  that  was  oriented  toward  a  syntactic  interlingua  ( i.e ..  language- 
independent  form)  based  on  deep  structures,  these  systems  used  large  language-specific  gram¬ 
mars  (o  parse  and  generate  the  source  and  target  languages. 

An  example  of  a  rule- based  syntactic  system  is  the  generator  of  the  METAL  translation 
system  (see  Slocum  (1081,  1985)),  which  is  currently  equipped  with  approximately  600  rules 
and  10,000  lexical  entries  in  each  of  the  two  main  languages  (German  and  English).  Bennett 
and  Slocum  (1985)  argue  that  the  transfer  translation  design  (i.e.,  the  mapping  of  “shallow 
analyses  of  sentences”  into  “shallow  analyses  of  equivalent  sentences”)  is  adequate  for  near- 
term  applications.  The  argument  against  employing  a  “deep  representation”  is  that  long¬ 
term  trials  of  such  approaches  seem  to  indicate  that,  a  suitable  “deep  representation”  is  not 
possible;  furthermore,  systems  Ilia)  use  a  “deep  representation”  cannot  handle  unrestricted 
input  (some  of  which  is  ungrammatical). 

Although  a  shallow  analysis-synthesis  of  sentences  might  avoid  some  problems  associated 
with  current  interlingual  translation  approaches,  the  complexity  and  language-specific  nat  ure 
of  the  rules  t  ranslate  into  several  problems.  First  of  all,  because  the  rules  and  lexical  entries 
are  so  complex,  the  subject  area  must  be  very  limited.  Secondly,  each  rule  is  highly  language- 
dependent  in  character;  thus,  there  must  be  a  set  of  target -specific  transfer  rules  for  every 
language  that  will  serve  as  a  target.  This  means  that  the  rule  system  grows  rapidly  as  each 
target  language  is  added  to  t  he  system.  Thirdly,  the  rules  are  very  stipulatory;  there  are  no 
theoretical  reasons  lor  the  rules  being  the  way  they  are.  Finally,  each  rule  must  carefully 
spell  out  the  details  of  its  application;  thus,  there  is  no  way  to  capture  linguistic  generality 
among  the  rules  in  the  system  since  general  constraints  are  not.  factored  out  of  the  syntactic 
rules. 

Two  other  systems  that  take  a  syntactic  approach  to  generation  are  the  TEXT  system 

5 Adinitt<*<llv.  template-systems  arc  generally  not  geared  toward  discovering  or  implementing  a  linguis¬ 
tic  theory.  Swartout  acknowledges  that  his  generator  consists  of  the  hare-minimum  required  to  produce 
acceptable  output;  thus,  linguistic  principles  are  ignored: 

The  generator  should  really  he  viewed  more  as  an  engineering  effort  that  attempts  to  produce 
acceptahle  English  rather  than  as  a  generation  system  that  encodes  deep  linguistic  principles. 

The  main  thrust  of  this  thesis  has  been  to  investigate  ways  of  representing  the  knowledge 
necessary  to  justify  expert  consulting  systems.  A  generator  is  necessary  to  demonstrate  the 
capabilities  of  the  approach  being  espoused  here,  but  the  generator  itself  has  not  been  the  focus 
of  the  research. 
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(define- stylistic  rule  PREFER-ADJECTIVES-TO-NEtf-SENTENCE 
ordering-on-attachment-points 

(at tach-as-adj active  at tach-as -new -sentence) 
applicability-condition 

(if  (includes-attachment -point  ’attach-as-adjective 
usable-attachment -points) 

(not  (or  (will-be-complex-adjective-phrase 

(usable-choices  ’ attach-as-adjective) ) 

(too-heavy-with-adj actives 
(np-being-attached-to  ’attach-as-adjective))) ) ) ) 

Figure  1:  Stylistic  Rule  Used  for  Adjectival  Attachment  in  MUMBLE 

(McKeown  (1983,  1985))  and  the  MUMBLE  system  (McDonald  (1983,  1987)).  These  two 
systems  are  similar  in  that  they  use  discourse  and  focus  constraints  to  derive  messages 
(i.e.,  underlying  representational  forms)  that  are  then  used  to  generate  syntactic  structures 
corresponding  to  the  surface  text.  Generation  of  syntactic  structures  in  TEXT  is  based  on 
the  use  of  discrimination  networks  (to  be  described  in  section  4.3)  and  functional  unification 
grammars  (see  Kay  ( 1984)).  Generation  of  syntactic  structures  in  MUMBLE  is  based  on  the 
use  of  tree-adjoining  grammars  (see  McDonald  and  Pustejovsky  ( 1985b))  and  stylistic  rules 
(see  McDonald  and  Pustejovsky  (  1985a)).  Although  both  of  these  systems  move  away  from 
the  rule-based  approaches  of  earlier  schemes,  they  do  not  take  advantage  of  structural  and 
lexical  generalization  across  languages.  For  example,  the  stylistic  rules  used  for  syntactic 
realization  in  MUMBLE  are  hand-generated;  not  only  are  they  specific  to  English,  but  they 
are  also  specific  to  the  domain  of  the  system.  Furthermore,  they  are  often  tedious  to  write, 
and  their  function  in  the  system  is  generally  not  readily  transparent.  Figure  1  shows  an 
example  of  such  a  stylistic  rule.  This  rule  is  used  for  attachment  of  an  adjective  to  a  noun- 
phrase  during  the  generation  process. 

In  general,  the  move  away  from  rule-based  syntactic  generation  systems  is  a  step  in  the 
right  direction.  However,  care  must  be  taken  to  prevent  language-dependent  devices  from 
showing  up  in  other  forms.  Language-independent  universals  need  to  be  dealt  with  in  a 
systematic  way  rather  than  in  an  ad  hoe  manner;  language-specific  idiosyncrasies  can  then 
be  handled  by  a  smaller  set  of  individually  applicable  routines. 

2.4  Semantic  Approach  to  Generation 

At  the  other  end  of  the  spectrum  of  generation  systems  are  those  systems  which  largely 
reject  syntax  as  a  basis  of  generation  for  language  translation.  Rather,  generation  is  treated 
almost  entirely  on  the  basis  of  semantics,  guided  by  a  strong  underlying  model  of  the  current 
situational  context  and  expectations.  (See  Lytinen  and  Schank  (1982),  Lytinen  (1985,  1987), 
Carbonell  (1981),  Cullingford  (1986),  Nirenburg  el.  al.  (1985,  1986,  1987).) 

The  semantic-based  (also  called  knowledge-based  systems)  are  generally  interlingual. 
That  is,  they  employ  a  conceptual  representation  that  is  independent  of  any  natural  lan- 
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guage.  Generally,  this  iiiterlingua  can  In*  encoded  by  means  of  primilin  meaning  units,  l  or 
example,  in  the  MORTRANS  system  (l.ytinen  ami  Sehnnk.  198*2),  the  Spanish  word  <<//<- 
turar  is  defined  as  GRT-(’()NTR()L  in  the  dictionary.  A  ciali'.ation  routine  deterinines 
that  caphmir  (  capture)  is  t « >  he  generated  as  the  word  urn  si  in  the  target  language  if  I  In¬ 
correct  context  ((.</.,  a  police  search)  has  been  instantiated. 

Several  arguments  for  choosing  a  semant ic- based  design  over  a  syntactic-based  design  lor 
generation  systems  have  prevailed.  The  first  is  that  the  number  of  rules  in  a  syntactic-based 
system  would  be  enormous:  a  word  may  have  several  word  senses,  and  each  word  sense  would 
require  a  myriad  of  rules  specifying  the  contexts  in  which  the  word  sense  might  appear. 

A  second  related  problem  is  indexing.  Since  there  are  thousands  of  rules  to  choose  from, 
“the  amount  of  information  the  system  would  have  to  look  for  would  be  enormous,  and 
deciding  what  information  in  the  sentence  was  relevant  for  disambiguating  the  word  in  each 
particular  context  would  be  impossible.'”’ 

The  third  argument  for  a  semantic-based  design  is  that  syntactic-based  approaches  tend 
to  be  overly  concerned  with  the  form  of  the  input  rather  than  the  content  (see  C’ullingford, 
1986).  Consequently,  these  grammar-based  approaches  do  not  easily  handle  deviant  input 
( e.g .,  input  that  is  ungrammatical). 

The  claim  that  rule-based  syntactic  systems  are  both  too  large  and  too  complex  to 
adequately  handle  natural  language  translation  may  be  well-grounded,  but  the  semantic- 
based  approach  does  not  combat  the  problem!  In  attempting  to  tackle  the  problem  of 
word  disambiguation,  semantic-based  systems  incorporate  an  incredibly  massive  amount  of 
knowledge,. effectively  limiting  t  he  domain  of  subject  mat  ter. 

An  additional  draw-back  to  semantic- based  approaches  is  that,  there  is  a  loss  of  structure 
and  style  in  generating  the  target  text  from  the  underlying  (interlingual)  form;  consequently, 
the  output  of  these  systems  is  a  paraphrase,  not  a  translation.  Although  the  deep  contextual 
meaning  of  the  input  text  is  preserved,  the  emphasis  or  intent  of  the  text  is  not  always  fully 
preserved.  The  claim  is  that  any  other  system  which  attempts  to  preserve  structure  and  style 
without  the  knowledge  necessary  for  text  understanding  would  often  produce  unreliable 
translations.  However,  the  loss  of  structure  and  style  may  involve  a  loss  of  some  of  the 
meaning  of  the  text.  Most  likely,  the  speaker  chooses  a  particular  structural  realization 
in  order  to  focus  on  a  specific  topic  or  to  make  a  crucial  point;  the  absence  of  structure 
preservation  might  result  in  a  complete  misinterpretation  of  the  text. 

Finally,  another  problem  with  know  ledge- based  generation  systems  is  that,  they  typically 
require  an  involved  general  inference  mechanism  in  order  to  arrive  at  the  surface  form  for  a 
primitive  concept.  Rather  than  basing  word  selection  on  general  lexical  principles,  complex 
inferencing  routines  are  applied  to  conceptual  representations.  (Some  examples  of  the  type 
of  inferencing  that  is  required  for  lexical  selection  will  be  shown  in  section  4.3.) 

2.5  The  Shift  Toward  an  LCS  Generation  Approach 

The  rule  systems  for  existing  natural  language  generators  are  still  large,  detailed,  and  com¬ 
plicated.  Furthermore,  generation  systems  lack  linguistic  motivation  for  the  rules  that  they 
do  have.  The  two  primary  tasks  of  natural  language  generation,  lexical  selection  and  syn- 
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tactic  nalization,  art*  not  dealt  with  in  a  systematic  manner;  rather,  «</  hoc  procedures  are 
applied  to  underlying  representations  to  arrive  at  surface  structures. 

If  the  basis  of  generation  designs  is  shifted  from  complex,  language-specific  rules  systems 
to  modular  syntactic  theories  that  empb>\  a  well-defined  lexical  conceptual  representation, 
several  of  the  problems  associated  with  earlier  theories  will  be  solved,  Grammars  will  no 
longer  be  huge  and  complicated;  small  sets  of  lexical-semantic  principles  will  replace  compli¬ 
cated  non-explanatory  generation  routines;  and  general  inferencing  will  no  longer  be  neces¬ 
sary.  The  next  section  describes  the  steps  involved  in  constructing  a  generator  on  the  basis 
of  LCS. 

3  Generation  Scheme 

Implicit  in  the  generator  are  three  components.  The  first  two  components  are  not  the  em¬ 
phasis  of  this  project,  but  they  are  nonetheless  necessary.  The  first  is  a  dictionary  containing 
lexical  conceptual  representations  that  serve  as  the  basis  for  generation  of  surface  struct  ures. 
Lexical  items  are  stored  in  the  dictionary  with  their  associated  properties,  such  as  mor¬ 
phological  feature  sets,  0-roles  that  are  assigned,  and  lexical-semantic  representations.  The 
second  component,  a  syntactic  and  morphological  synthesizer,  maps  a  base  form  (henceforth 
called  D-structure)  to  its  corresponding  surface  form  (henceforth  called  S-structure).  The 
dictionary  and  synthesizer  are  standard  components  of  any  generation  system;  however,  they 
dill'er  from  other  systems  in  that  they  are  based  on  LCS  structures  rather  than  rule-systems, 
semantic  networks,  or  discrimination  nets. 

I  he  final  component  of  the  generator  is  the  emphasis  of  the  project  discussed  here.  This 
is  t  lie  module  that  maps  the  lexical  conceptual  representation  of  a  sentence  to  the  D-structure 
of  the  target  language  sentence.  This  mapping  requires  both  lexical  replacement,  routines 
and  linking  rules  in  order  to  derive  predicate-atgument  structures  from  the  lexical-semantic 
representation.  For  example,  in  order  to  translate  a  source  language  verb  like  yustar  to  its 
target  language  equivalent  like ,  lexical-replacement  routines  must  match  the  LCS  structures 
of  these  two  verbs;  then  linking  rules  will  be  required  in  order  to  determine  the  structural 
positioning  of  the  arguments  (c.«/„  that  the  agent  is  externally  positioned  in  English,  not 
mli  molly  as  it  is  in  Spanish). 

Each  of  the  three  tasks  of  the  scheme  will  be  discussed  in  the  following  sections. 

3.1  Construction  of  a  Dictionary 

The  goals  of  this  portion  of  the  project  are  consistent  with  those  put  forth  by  the  lexicon 
project  in  the  Center  for  Cognitive  Science  at  MIT.  The  focus  is  on  representing  knowledge 
of  the  syntactic  and  semantic  properties  of  lexical  items,  particularly  of  verbs  and  their 
arguments. 

In  order  to  construct  a  dictionary,  it  is  necessary  to  identify  and  utilize  verbal  properties 
through  a  study  of  lexical  organization.  Typically,  lexical  entries  provide  a  minimal  specifi¬ 
cation  of  the  syntactic  expression  of  the  arguments  of  verbs.  Within  GB  theory,  there  has 
been  a  move  away  from  explicit  use  of  subcategorization  frames  since  the  syntactic  relations 
between  the  constituents  in  a  sentence  can  be  derived  by  two  requirements:  that  0-roles  be 


assigned  under  government  and  that  mums  lie  assigned  ease  to  he  well  formed. 

In  the  process  of  building  a  dictionary,  several  subtasks  are  relevant: 

1.  Identification  of  thematic  relations  (and  verification  of  their  existence). 

2.  Organization  of  thematic  relations  (such  as  AGE  VI  )  into  classes  according  to  the 
constraints  they  are  subject  to.  I  his  may  lead  to  the  construction  o|  a  thematic  hierarchy, 
if  such  a  t  hing  exists. 

3.  Construction  of  a  mapping  bet  ween  t  hematic  relations  and  syut  actic  arguments  (taking 
into  account  the  fact  that  the  mapping  may  not  l»e  one-to-one). 

4.  Refinement  of  verb  classes  through  examination  of  cooccurrence  restrictions. 

3.2  Mapping  of  D-Structure  to  S-Structure 

Two  components  are  required  to  map  the  I)  structure  to  its  corresponding  surface  form. 
The  first  is  a  syntactic  synthesizer,  essentially  a  movement  module,  that  displaces  tokens 
according  to  requirements  of  Case  Theory  (of  Government-Binding).  The  second  component 
is  a  morphological  synthesizer  that  converts  a  root  form  and  a  set  of  features  into  a  surface 
form. 

The  movement  module  accesses  certain  parameter  settings  corresponding  to  the  language 
to  be  generated.  These  parameter  settings  determine  the  type  of  movement  required.  For 
example,  the  ich- movement  parameter  setting  for  English  dictates  that  Subject  -  A  ux  Inversion 
(SAI)  is  to  be  triggered.  Consequently,  the  generator  will  perform  «’/>- movement  and  SAI  to 
produce  an  output  form.  By  contrast,  in  Spanish  the  wh- movement  parameter  is  set  such 
that  V -  Preposing  (not  SAI)  occurs. 

The  morphological  module  converts  root-Kf eature>  forms  into  surface  forms  ( »  .«/., 
"road+3S"  is  converted  to  "reads").  This  requires  two  mappings:  one  from  features  to 
possible  affixes  [<.</.,  "3S"  ■=•  "s"),  and  one  from  <root>+<aff  ix>  to  possible  surface  forms 
( e.g.,  "read+s"  "reads"). 

3.3  Mapping  of  Lexical  Conceptual  Structure  to  D-Structure 

In  order  to  map  the  lexical  conceptual  representations  that  comprise  a  sentence  to  the  target 
language  D-structure  of  the  sentence,  two  modules  are  needed:  (1)  a  lexical  replacement 
module  that  determines  the  corresponding  target  language  {e.g.,  English)  words  for  the 
LCS's  produced  by  parsing  the  source  language  ( e.g .,  Spanish)  sentence;  and  (2)  a  syntactic 
module  that  performs  the  necessary  operations  in  order  to  arrive  at  the  target  language 
D-structure  of  the  sentence.  Thus,  there  are  two  top-level  operations  during  the  mapping 
from  LCS  to  D-structure:  selection  of  target  language  words,  and  linking  of  surface-sentence 
words  to  their  corresponding  syntactic  position. 

The  input  to  this  component  of  the  generator  is  a  set  of  LCS’s  produced  by  a  parser  that 
maps  source  language  sentences  to  their  underlying  structures.  The  output  is  the  target 
language  deep  structure  representation  that  will  be  used  to  generate  the  surface-sentence  (by 
routines  discussed  in  the  last  section).  The  selection  of  lexical  translations  for  each  token  in 
a  given  underlying  form  begins  with  the  predicate.  The  dictionary  entry  corresponding  to 
the  predicate  is  accessed,  the  surface  verb  of  the  sentence  is  selected,  and  the  arguments  of 
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the  predicate  are  mapped  to  the  case  roles  of  the  verb.  Then  the  entries  for  each  argument 
are  accessed  to  return  the  lexical  translations  for  t  he  remainder  of  the  proposition. 

To  illustrate  this  process,  we  will  look  at  the  translation  of  the  word  pom  v  (  pul  in 
English).  Suppose  the  source  language  sentence  is  go  pongo  cl  I  Him  in  In  raja  (  /  pul  /In 

hoot,'  in  the  box).  First  the  lexical  entry  for  the  word  pom  r  is  accessed.  Itccall  that  lexical 
entries  contain  the  LOS  and  ^-marking  requirements: 


PONER:  (event  (posit  MOVE  (thing  X)  (thing  Y)  (place  Z))) 

X  =  agent,  Y  =  theme,  Z  =  locatum 

Next,  the  process  of  selection  matches  this  LOS  to  that,  of  the  English  verb  put  (repeated 
here  for  clarity): 

(event  (posit  MOVE  (thing  X)  (thing  Y)  (place  Z))) 

The  deep  structure  of  the  sentence  is  dependent  on  its  verb  (e.g.,  how  many  objects 
it  takes,  whether  it  has  a  subject,  etc.).  Once  a  verb  has  been  selected  to  translate  the 
predicate,  the  semantic  arguments  of  the  deep  structure  are  filled  with  the  instantiated 
arguments  of  the  predicate.  Thus,  after  put  lexically  replaces  poner ,  the  linking  process  is 
activated.  The  ^-marking  properties  of  put  combined  with  linking  rules  for  agent ,  theme. 
and  locatum  derive  the  following  predicate-argument  (deep)  structure: 

(2)  |,  Np  I;  |vp  put  [NP  the  book]  [PP  in  the  box]]] 

Here,  the  linking  rules  have  mapped  the  agent  (=  /)  into  external  argument  position,  and 
the  theme  (=  the  book)  and  locatum  (--  in  the  box)  into  internal  argument  position.  (See 
section  4.3  for  an  example  of  a  linking  rule.) 

4  Embodiment  of  Linguistic  Theory 

The  above  scheme  of  representation  and  generation  should  be  constructed  in  such  a  way 
that  properties  that  are  shared  among  all  languages  are  handled  by  a  unified  set  of  “core” 
linguistic  principles,  while  the  differences  among  languages  are  accounted  for  by  a  set  of 
possible  parameters  of  variation.  In  this  view,  many  properties  of  particular  languages  can 
be  accounted  for  through  the  interaction  of  principle-based  subsystems,  while  complexes  of 
properties  differentiating  otherwise  similar  languages  should  (ideally)  be  reducible  to  a  single 
parameter,  fixed  in  one  or  another  way.'  Thus,  in  order  to  build  a  generator  for  machine 
translation,  it  is  necessary  to  determine  both  the  lexical  properties  that  make  words  similar 
across  languages,  as  well  as  the  properties  that  distinguish  words  cross-linguistically.  In 
terms  of  the  generation  approach  discussed  here,  the  “core”  linguistic  principles  are  those 
procedures  required  for  selection  of  words  and  linking  of  LCS  to  syntactic  structure,  while 
the  parameterization  occurs  in  the  lexicon,  with  individual  lexical  items  taking  on  their  own 
language-particular  “meaning”  and  thematic  role-assigning  properties. 

‘  A  brief  overview  of  the  principles  of  tiB-lheory  is  presented  in  Dorr,  11187. 
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Recall  that  there  are  two  top-level  operations  lor  mapping  IT'S  to  I >  structure:  >./(<  - 
lion  ami  linkin').  Before  developing  procedures  lor  these  t  wo  operations,  it  is  necessary  to 
examine  some  examples  of  source- to-target  language  translations  and  to  determine  some  of 
the  difficulties  that  might  arise  during  generation  of  target  language  sentences.  ()|  partic 
ular  concern  are  the  problems  of  ■  hematic  divergence  (which  makes  selection  difficult  i  and 
structural  divergence  ( which  makes  linking  diliicult  |.  Some  examples  should  shed  some  light 
on  what,  is  needed  for  both  the  lexical  conceptual  representation  of  words  as  well  as  the 
mapping  from  this  representation  to  the  surface  form.  In  t  he  examples  shown  here.  Spanish 
and  English  are  the  two  languages  used.  Other  languages  (e.g.,  German  and  Japanese)  will 
also  be  tested  when  the  implementation  is  complete.  A  generation  scheme  based  on  LCS  will 
be  presented  as  a  solution  for  the  problems  exhibited  in  the  examples.  I  will  then  discuss 
the  goals  of  the  scheme. 

4.1  Example  1:  Thematic  Divergence  as  a  Problem  for  Lexical 
Selection 

The  task  of  lexical  selection  is  difficult  because  of  the  possibility  of  thematic  dice ryt  nee.  i.e.. 
a  difference  in  the  order  of  thematic  role  assignment.  An  example  of  thematic  divergence 
is  the  translation  of  the  Spanish  word  guslar  to  the  English  word  like.  Although  these  two 
verbs  are  semantically  equivalent,  their  argument  structures  are  not  identical:  the  subject 
of  guslar  is  the  patient  of  the  action,  whereas  the  subject  of  like  is  ihe  a>j>  nt  of  the  action. 
Thus,  we  have: 

(3)  Me  gusta  el  libro  a  mi 

(To  me  the  book  pleases  me) 

‘I  like  the  book’ 

In  general,  cases  such  as  (3)  are  not  problematic.  The  difference  in  order  of  thematic  role 
assignment  is  easily  manipulated  by  simple  procedures  that  check  thematic  requirements  of 
the  two  verbs.  Furthermore,  the  verb  guslar  can  have  the  translation  like  stored  directly 
in  its  lexical  entry  since  this  is  the  only  possible  translation  for  it.  However,  problems  arise 
when  a  verb  has  more  than  one  translation  depending  on  the  select ional  restrictions  of  its 
arguments.  Two  examples  are  the  English  words  slash  and  snuar : 

(4)  (i)  He  slashed  the  woman 

‘Dio  cuchilladas  a  la  mujer’ 

(ii)  He  slashed  the  paper 
‘Corto  el  papel1 

(5)  (i)  She  smeared  her  makeup 

‘Embarro  su  inaquillaje’ 

(ii)  She  smeared  the  wall  with  paint 
‘Pintarrajeo  a  la  pared’ 

In  (4)(i)  the  translation  of  slash  is  the  composite  form  dar  cuchilladas  a.  whereas  in  (4  )(ii) 
the  translation  of  slash  is  the  single  word  corlar.  In  f»)(i)  smear  is  translated  directly  to  the 
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Spanish  word  embarivr.  bill  in  (5)(ii)  smear  translates  l»  tin-  more  complex  Spanish  word 
pintarrajear  a  which  implicitly  incorporates  the  nominal  argument  /mini.  In  such  eases,  it 
is  not  possible  to  store  direct  translations  in  (lie  lexical  entries  of  tin-  verbs  since  argument 
incorporation  is  sometimes  required:  in  the  case  of  slush,  the  translation  breaks  down  into  a 
more  basic  argument  -st  met  arc  dor  einhillnihi.-  n  ( literally  1  ran  slated,  it  means  In  yin  liuji  - 
iroimils  to);  and  in  the  case  o|  sonar  the  translation  combines  the  argument  paint  with  the 
verb  in  order  to  arrive  at  pintarra year  a.  Thus,  we  see  a  need  lor  word  definition  in  terms 
of  more  basic  meaning  structures  in  order  to  choose  an  accurate  translation  at  generation 
time. 

4.2  Example  2:  Structural  Divergence  as  a  Problem  for  Linking 

Linking  a  meaning  structure  to  its  surface-syntactic  representation  is  difficult  because  of 
cases  of  structural  divergence  between  languages.  In  general,  in  these  cases,  there  is  also  a 
selection  problem  (in  fact,  the  choice  of  an  equivalent  target  language  verb  may  lead  to  a 
non-equivalent  surface-structure  representation).  An  example  of  structural  divergence  is  the 
translation  of  the  Spanish  verb  lenrr  as  the  English  verb  be  in  certain  cases: 

(ti)  Tengo  calor 
( I  have  heat ) 

“I  am  hot* 

The  predicate-argument  structure  for  U  in  r  ealnr  i>: 

(7)  \VP  tener  [NP  calor)] 

The  predicate-argument  structure  for  be  hoi  is: 


Here,  a  noun-phrase  argument  must,  be  changed  into  its  adjectival- phrase  counterpart. 

There  are  also  structural  divergences  in  which  adjuncts  and  arguments  are  either  added 
or  deleted  in  the  resulting  translation.  In  general,  if  a  token  is  added,  that  token  was  implicit 
in  the  original  source  language  verb:  if  a  token  is  deleted,  that  token  becomes  incorporated 
into  the  target  language  verb.  An  example  in  which  a  token  is  implicit  in  the  source  language 
verb  is  the  composite  verb  throw  down;  the  translation  is  cchar  por  tierra  (literally,  throw 
to  the  ground): 

(!))  He  threw  down  the  book 
‘Echo  por  tierra  el  libro’ 

Whereas  throw  down  is  syntactically  a  single  unit,  relta r  por  tirriv  consists  of  a  verb  with  a 
prepositional  adjunct: 

(10)  [vp  [v  throw-away]j 

[vp  [vp  [v  echar]]  |PP  por  tierra] j 

On  the  other  hand,  the  composite  verb  throw  away  is  simply  translated  as  Hear,  which 
has  the  token  away  incorporated  directly: 


'  i  *  •>  #  ,  * 


(II)  Ho  threw  away  the  hook 
•  Tiro  el  libro’ 

Thus,  the  two  syntaetie  struelures  are  ossont iall y  the  same: 

(  12)  jvr  v  throw -down!! 

j vi»  iv  tirarj: 

Similarly,  the  Spanish  verb  forzar  may  have  the  translation  break  into  (as  in  (13)(i)) 
if  the  token  la  entrada  is  present  (the  literal  translation  is  foire  tin  entry),  or  it  may  he 
translated  simply  as  force  (as  in  ( L 3 )( ii ) ): 

( 13)  (i)  Forzo  la  entrada  a  la  casa 

‘He  broke  into  the  house’  ^ 

(ii)  Forzo  el  ejercito  rendir 

‘He  forced  the  army  to  surrender’ 

The  corresponding  divergent  and  equivalent  structures  for  these  examples  are: 

(14)  (i)  [Vp  [v  forzarj  [NP  la  entrada)] 

[VP  [v  break]  [PP  into  ■  •  •]] 

(ii)  [Vp  (v  lorzarj  [NP  el  ejercito;] 

\  break]  ;Np  the  army] 

4.3  Lexical  Conceptual  Structure 

The  translation  examples  above  provide  strong  evidence  that  a  suitable  representation  for 
lexical  conceptual  structure  is  needed.  Previously,  generation  systems  used  discrimination 
nets  in  order  to  select  the  appropriate  surface  forms  for  underlying  concepts.  For  example, 
in  (’arbonell,  el.  al.  (1981)  the  sentence  Mary  hit  John  is  represented  as: 

(event  EV001 

(action  PROPEL) 

(agent  NARY) 

(object  JOHN) 

(instrument  *UNKN0WN*) 

(force  *AB0VE- AVERAGE*) 

( intentional it y  *P0SITIVE*)) 

In  order  to  translate  the  above  concept  into  Spanish,  the  main  action  (PROPEL)  is  mapped 
to  the  discrimination  network  shown  in  figure  2.  This  network  is  then  used  to  choose  the 
correct  verb.  As  a  series  of  If-Then  statements,  this  net  expands  into  the  complex  block  of 
code  shown  in  figure  3. 

In  the  above  scheme,  there  is  no  representation  of  the  “meaning’'  of  the  verbs  being 
generated;  rather,  the  mapping  from  concept  to  surface  form  is  performed  by  means  of  ad 
hoc  inferencing  procedures  that  test  selectional  restrictions  of  arguments  and  act  accordingly. 
The  problem  with  such  an  approach  is  that  the  net  work  can  grow  very  large  as  more  verbs  are 
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Figure  2:  Discrimination  Network  for  PROPEL,  from  C'arbonell,  rt.  al.  (l!>81) 

added,  and  the  time  it  takes  to  search  lor  an  appropriate  verb  is  exponential.  Additionally, 
this  same  network  must  be  searched  ret  ry  time  a  verb  expressing  PROPEL  conceptualizations 
is  to  be  generated;  thus,  a  great  deal  of  time  is  wasted  testing  for  irrelevant  selectional 
restrictions.  For  example,  even  though  no  selectional  restrictions  apply  on  the  arguments  of 
“golpear'2”,  the  entire  network  must  still  be  searched  in  order  to  generate  this  surface  form. 

There  are  additional  problems  with  this  network  approach.  First,  the  network  structure 
does  not  readily  accommodate  adding  new  words  or  deleting  old  ones  since  major  surgery 
of  the  network  is  typically  required.  Second,  generation  of  compound  predicate-argument 
structures  is  not  accommodated  by  such  a  scheme  since  all  the  paths  lead  to  a  single  lexical 
item  (not  a  set  of  lexical  items);  thus,  neilher  thematic  nor  structural  divergence  is  explained 
in  such  a  scheme.  Finally,  the  scheme  is  not  general  enough  to  apply  across  several  languages 
since  surface  forms  are  not  represented  as  basic  units  of  meaning;  consequently,  a  new  network 
must  be  hand-written  for  each  language  to  be  generated. 


Ill 


If  ACTOR  is  Human 
Then  If  ACTION  is  intentional 

Then  If  OBJECT  is  animate 

Then  If  INSTRUMENT  is  used 

Then  If  INSTRUMENT  is  part  of  body  of  ACTOR 
Then  If  foot 

Then  "pat ear” 

Else  “pogar2“ 

Else  If  sharp 

Then  “apunalar" 

Else  “go I pear  I  " 

Else  “pegarl” 

Else  If  INSTRUMENT  is  used 
Then  "golpearl" 

Else  "empujar" 

Else  If  size(  OBJECT)  size(ACTOR) 

Then  “aplastar" 

Else  “chocar” 

Else  If  ACTOR  is  self-propelled 

Then  If  size( OBJECT)  size(  ACTOR) 

Then  “aplastar" 

Else  “chocar" 

Else  If  ACTOR  is  gravity 

Then  If  OBJECT  is  ground 
Then  “terremoto" 

Else  “caer” 

Then  If  force  is  very  small 
Then  “rozar" 

Else  “golpear’2" 

Figure  3:  If-Then  Code  for  PROPEL  Discrimination  Network 
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In  order  to  model  cross -linguist ic  variations  in  predicate-argument  structures  such  as 
these,  an  adequate  lexical-semantic  representation  is  required.  According  to  Talmy  ( 1  ?)S5 ) 
(following  Jackendoff  and  Ci'ruber).  verbs  should  he  defined  in  terms  of  many  semantic  com¬ 
ponents  that  contribute  to  the  overall  meaning.  Thus,  verbs  may  have  a  semantic  repre¬ 
sent  at  ion  that  is  not  entirely  exhibited  at  the  level  of  syntactic  structure.  For  example,  the 
verb  inter  incorporates  an  “understood"  particle  into  as  part  of  its  meaning  structure;  this 
particle  manifests  itself  in  the  equivalent  composite  predicate  go  into.  This  incorporation  or 
conflation  of  properties  is  where  cross-linguistic  parametric  variations  are  revealed.  For  ex¬ 
ample,  where  English  conflates  manner  and  motion  in  the  boat  floated  on  the  water ;  Spanish 
disallows  this  conflation,  requiring  a  syntactic  realization  for  each  semantic  component:  ta 
barca  se  mudaba  flotando  en  el  oyua  (literally,  this  is  the  boat  moved  floating  on  the  water). 

Using  a  representation  similar  to  that  of  Jackendoff  ( 1972,  1983),  the  semantic  equivalence 
between  enter  and  go  into  is  easily  modeled: 

enter  =  (event  (posit  GO  (thing  X)  (path  TO  (place  IN  (thing  Z))))) 
go  =  (event  (posit  GO  (thing  X)  (path  Y))) 
into  =  (path  TO  (place  IN  (thing  Z))) 

Here,  the  LC’S  forms  for  go  and  into  can  be  composed  into  the  more  complex  LC’S  form  for 
enter. 

Similarly,  the  LC’S  for  float  can  be  decomposed  into  the  LCS  forms  for  move  and  float: 

float  =  (event  (posit  GO  (thing  X)  (property  BUOYANT)  (path  Z))) 
move  =  (event  (posit  GO  (thing  X)  (property  Y)  (path  Z))) 
floating  *  (property  BUOYANT) 

Note  that  this  approach  differs  from  that,  of  C’arbonell,  et.  at.  (1981)  in  that  the  “mean¬ 
ing”  structure  ( i.e.,  LCS)  is  stored  directly  in  the  lexical  entry  of  each  word;  it  is  not  derived 
by  network  navigation.  The  primit  ives  of  the  scheme  described  here  are  used  compositionally 
to  define  words  of  the  source  and  target  languages.  Because  source  language  definitions  are 
matched  against  target  language  definitions  to  select  the  correct  target  language  words,  there 
is  no  need  to  test  properties  of  arguments;  thus,  time-consuming  and  unnecessary  searches 
are  avoided.  Furthermore,  through  a  combination  of  a  small  set  of  linking  rules  and  a  list 
of  0-role  assigning  properties,  the  LC’S  scheme  provides  a  facility  for  syntactic  realization 
of  surface  forms.  In  contrast,  it  is  not  dear  how  syntactic  structure  is  realized  using  the 
discrimination  network  approach. 

With  respect,  to  generation  in  the  context  of  machine  translation,  this  decomposition  of 
meaning  is  useful  in  the  mapping  from  underlying  LC’S  forms  to  target  language  surface 
forms.  In  dealing  with  thematic  divergence,  LC’S’s  provide  a  uniform  representation  for 
equivalent  source-target  pairs.  Thus,  both  gustar  and  like  have  the  following  LCS: 

(event  CAUSE  (thing  X)  (poss  BE  (thing  Y)  (property  PLEASED))) 

The  difference  in  thematic  role  assignment  can  then  be  determined  by  means  of  properties 
of  the  individual  lexical  items.  The  agent  0-rolo  will  be  assigned  to  X  in  the  case  of  gustar 
and  to  Y  in  the  case  of  like. 
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The  LCS  ?,«-me  is  also  handy  in  t  he  ease  where  a  verb  may  t  ranslate  into  more  t  han  one 
surface  form  depending  on  its  arguments.  We  saw  that,  smear  translates  either  directly  as 
embarmr  or  as  the  conflated  verb  pintarrajear  (if  the  object  that  is  being  smeared  is  paint). 
The  LOS  for  smear  and  emharrar  is:x 

(event  (poss  MOVE  (thing  X  (property  FLUID)) 

(path  ALONG  (place  (poss  ON)  (thing  Y))))) 

The  representation  for  the  noun  paint  is: 

(thing  PAINT  (property  FLUID)) 

and  the  representation  for  pintarrajear  is: 

(event  (poss  MOVE  (thing  PAINT  (property  FLUID)) 

(path  ALONG  (place  (poss  ON)  (thing  Y))))) 

Thus,  smear  paint  will  be  translated  as  pintarrajear  since  the  LCS  of  the  noun  paint  matches 
the  object  of  MOVE  in  the  LCS  for  pintarrajear.  On  the  other  hand,  smear  makeup  will  be 
translated  as  embarrar  maquillajc  since  the  word  maquillaje  does  not  match  the  object  of 
MOVE  in  the  LCS  for  pintarrajear. 

The  LCS  scheme  also  provides  an  adequate  model  of  structural  divergence  in  the  link¬ 
ing  bf  meaning  structure  to  its  surface-syntactic  representation.  Recall  that  tencr  calor  is 
translated  to  the  structurally  divergent  form  be  hot.  The  LCS  for  have  is: 

(state  (poss  BE  (thing  X)  (place  (poss  AT)  (thing  Y)))) 

The  LCS  for  be  is: 

(state  (ident  BE  (thing  Y)  (property  X))) 

In  the  case  of  tener  rotor,  the  first  LCS  is  instantiated;  thus,  X  is  set  to  be  color  and  Y 
is  set  to  be  the  agent  of  the  predicate.  (The  assignment  of  agent  to  Y  is  specified  as  a 
language- particular  property  of  the  verb  tener  in  the  lexicon.)  This  LCS  is  then  mapped  to 
the  LCS  for  be,  where  Y  is  the  agent,  and  X  is  converted  into  the  property  hot  corresponding 
to  calor  ( i.e .,  the  nominal  form  heat  is  changed  into  the  adjectival  form  hot).  Note  that  the 
difference  between  the  source  and  target  structure  is  determined  solely  on  the  basis  of  the 
identification  of  X  as  a  property  rather  than  a  thing  in  the  LCS  for  the  target  language  verb. 

Structural  divergence  due  to  conflation  is  also  modeled  by  the  LCS  scheme.  As  we  have 
seen,  the  composite  verb  break  into  is  translated  as  forzar  la  entrada.  The  LCS  for  break 
into  is: 

“The  representation  shown  here  is  primarily  based  on  Jackendoff's  conceptual  structures;  however,  dis¬ 
cussion  with  Michael  Brent  influenced  me  to  add  the  property  FLUID.  A  more  elaborate  LCS  form  for  verbs 
such  as  throw ,  smear,  and  spray  are  in  Brent  (1988);  however,  the  simple  representations  shown  here  are 
adequate  for  the  purposes  of  this  discussion. 


(event  GO  (thing  X)  (path  TO  (place  (poss  IN)  (thing  Y))) 

(property  VIOLENTLY)) 

The  LOS  for  forzar  is: 

(event  GO  (thing  X)  (path  Y)  (property  VIOLENTLY)) 
and  the  LOS  for  entrada  is: 


(path  TO  (place  (poss  IN)  (thing  Y))) 

Thus,  in  linking  the  compound  LOS  for  break  into  to  the  target  language  syntactic  form,  the 
compound  LOS  must  be  decomposed  into  the  individual  LCS’s  for  forzar  and  entrada',  these 
decomposed  structures  are  then  linked  to  the  surface-syntactic  representation  for  forzar  la 
entrada. 


4.4  Goals  of  the  Generation  Scheme 

If  the  system  is  to  handle  the  examples  mentioned  above,  it  should  embody  modern  lin¬ 
guistic  theory  so  that  it  provides  an  explanatory  model  of  language  generation.  In  order  to 
be  explanatorily  adequate,  the  system  must  base  its  operation  on  general  procedures  that 
adhere  to  well-defined  linguistic  principles.  Furthermore,  the  system  must  include  several 
parameters  of  variation  so  that  it  is  flexible  enough  to  handle  several  languages.  This  param¬ 
eterization  also  fulfills  the  goal  of  extendability;  adding  new  languages  reduces  to  changing 
parameter  values  of  the  system. 

An  additional  goal  is  that  of  expressive  power.  The  primitives  that  are  the  basis  of  the 
system  should  be  designed  with  cross-linguistic  applicability  in  mind.  In  order  to  parame¬ 
terize  the  system,  the  primitives  must  be  adequate  for  composition  into  complex  meaning 
structures  that  map  into  the  words  of  both  the  source  and  the  target  language. 

The  goal  of  avoiding  ad  hoc  rules  can  be  fulfilled  if  the  scheme  makes  use  of  a  more 
restrictive  theory  of  lexical  semantics  than  that  of  existing  generation  systems.  Furthermore, 
the  semantic  structures  should  be  designed  so  that  general  inference  will  not  be  required  in 
order  to  select  target  language  words  in  the  generation  process.  As  long  as  the  mapping  from 
LC’S  to  surface  form  is  uniform  across  all  LC’S  forms,  general  inferencing  procedures  will  not 
be  required.  The  operations  of  lexical  selection  and  syntactic  realization  are  simplified  once 
rules  and  general  inferencing  are  eliminated:  LCS  and  0-role  mappings  obviate  the  need  for 
complicated  network  searches  and  rule  applications.  Finally,  exponential  search  or  varying 
search  time  for  different  words  can  be  avoided  if  there  are  no  general  inference  procedures. 

An  example  of  how  the  LCS-based  translation  process  will  operate  at  each  stage  is  the 
following: 

Source  Language  Sentence: 

El  libro  me  gusta  a  mi 

Source  Language  Parse: 

[s  [NP  el  libroj  (VP  me  gusta  [NP  a  mi]]] 

Instantiation  of  LCS  (Spanish): 

(event  CAUSE  (thing  libro  (poss  BF,  (thing  <  mi  >„,i)  (property  PLEASED))) 
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)  (properly  I'l. EASED))) 


Instantiation  of  LCS  (English): 

(event  CAUSE  (thing  hook  (poss  HE  (thing  I 

Target  Language  Generation: 

[..  |N|.  !|  (Vi-  1  ik«*  |NI.  tl»«'  hook  ! 

Target  Language  Sentence: 

1  like  the  hook. 


5  Work  To  Be  Done 

The  generator  will  be  an  “inverse  UNITRAN  parser;”  it  will  replace  the  generation  compo¬ 
nent  that  is  currently  part  of  the  UNITRAN  machine  translation  system.  In  order  to  build 
the  LCS-based  generator,  several  tasks  must  he  undertaken.  First .  tin*  selection  of  primitives 
is  necessary.  All  of  the  LOS  forms  are  based  on  cross-linguistically  applicable  primitives  (like 
GO  and  BE)  that  must  be  carefully  defined.  The  primitives  must  he  designed  so  that  they 
are  easily  programmable,  but  they  are  not  decomposable  (in  any  language). 

The  next  task  is  the  construction  of  the  LCS  forms.  This  means  that  the  primitives  must 
be  composed  in  a  certain  manner  in  order  to  arrive  at  certain  meaning  structures.  Section  4.11 
gives  some  examples  of  how  the  primitives  (like  GO,  BE,  tic.)  are  composed  to  form  words 
with  complex  meanings  structure  (like  enter  and  go  into). 

An  additional  task  is  to  provide  a  mapping  from  LOS  to  surface  structure.  This  includes 
routines  for  both  selection  and  syntactic  realization.  Principles  that  are  already  built  into 
the  UNITRAN  system  will  be  operative  during  this  mapping  (as  they  are  during  parsing); 
however,  thematic  role  assignment  will  have  to  be  extended  to  include  assignment  to  variables 
in  LCS. 

In  addition  to  the  actual  construction  of  the  system,  methods  of  testing  and  evaluating 
the  system  need  to  be  devised.  In  particular,  cross-linguistic  generalization  will  need  to  be 
tested.  This  can  be  done  by  trying  the  system  on  other  languages.  In  addition  to  English  and 
Spanish,  the  two  languages  that  will  be  tested  are  German  and  Japanese.  It  must  be  possible 
to  perform  lexical  selection  on  the  basis  of  LCS  structures  for  all  four  of  these  languages; 
furthermore,  syntactic  realization  must  work  correctly  for  each  language.  In  order  lor  this 
endeavor  to  be  realized,  parameters  of  variation  must  he  established.  On  the  syntactic  side, 
the  UNITRAN  system  is  already  parameterized  according  to  GB  theory.  On  the  lexical- 
semantic  side,  parameterization  occurs  in  the  lexicon  and  in  the  linking  rules.  Once  the 
settings  are  established  for  the  languages  handled  by  the  system,  an  evaluation  can  be  made 
on  the  basis  of  the  correctness  of  translated  sentences. 

6  Difficulties  to  be  Addressed 

The  first  consideration  in  building  the  generator  is  that  it  must  be  constructed  so  that 
it  is  based  on  the  same  principles  that  the  parser  uses.  The  principles  that,  are  already 
part  of  UNITRAN  are  primarily  syntactic  in  nature;  thus,  they  will  not  affect  the  lexical 
conceptual  structure,  but  they  will  affect  how  the  syntactic  portion  of  the  generator  operates. 
For  example,  during  the  structural  realization  process  (or  linking ),  the  satisfaction  of  cert  ain 
syntactic  constraints  must  still  be  maintained  ( e.g.,  that  a  verb  governs  its  object  in  English ). 


Another  difficulty  is  the  construction  of  primitives.  It  is  not  clear  how  many  primitives 
to  have,  nor  is  it  easy  to  determine  that  the  primitives  are  indeed  non-riecomposable  in 
every  language.  Furthermore,  the  primitives  must  he  easy  to  represent  and  to  compose  into 
complex  meaning  structures. 

The  process  of  lexical  selection  might  also  he  problematic  in  that  it  is  not  always  easy 
to  determine  how  far  an  LCS  shottld  he  broken  down  before  generating  a  surface  form.  For 
example,  recall  that  the  LCS  for  smear  paint  is: 

(event  (poss  MOVE  (thing  PAINT  (property  FLUID)) 

(path  ALONG  (place  (poss  ON)  (thing  Y))))) 

In  Spanish,  this  can  either  be  broken  down  into  two  non-composite  surface  forms  embarrar 
pinitira  (literally,  smear  paint),  or  it  can  be  left  as  the  composite  surface  form  pintarrajear. 
In  order  to  solve  this  problem,  a  principle  of  conservation  will  be  needed:  the  most  complex 
set  of  words  that  matches  an  LCS  will  be  chosen  for  generating  a  surface  form.  In  the  above 
example,  pintarrajear  is  chosen. 

Another  difficulty  to  be  addressed  is  that  thematic  role  assignment  will  need  to  be  modi¬ 
fied  to  apply  to  instantiated  LCS  arguments,  but  it  still  must  remain  consistent  with  syntactic 
principles  that  are  already  part  of  the  system  (e.g.,  the  0-Criterion).  Thus,  while  thematic 
roles  are  used  in  the  mapping  from  the  LCS  to  the  syntactic  structure,  they  must  still  be 
preserved  after  the  syntactic  structure  is  derived  in  order  to  satisfy  syntactic  principles  that 
already  exist. 

A  final  difficulty  to  be  addressed  is  that  of  final  realization  of  the  source  language  surface 
structure.  Once  the  appropriate  LOS  has  been  chosen,  the  correct  surface  forms  have  been 
selected,  and  linking  has  taken  place  to  derive  a  syntactic  structure,  the  generator  must 
perform  certain  movement  operations  in  order  to  arrive  at  the  final  surface  structure.  For 
example,  the  V-Preposing  operation  in  Spanish  fronts  a  verb  when  a  irA-question  is  asked: 

(15)  i.Que  vio  Juan? 

(What  saw  John?) 

‘What,  did  John  see’ 

In  order  to  generate  the  V-preposo  form,  a  movement  parameter  must  be  accessed.  This 
parameter  is  set  to  V-prepose  in  Spanish  (and  SAI  in  English);  thus,  V-preposing  will  occur 
in  Spanish  (and  SAI  in. English)  when  a  wh- phrase  is  found  in  the  correct  position. 

Despite  these  difficulties,  once  the  generator  design  is  chosen,  it  should  be  possible  to 
make  headway  toward  reducing  the  amount  of  information  and  time  required  for  machine 
translation.  Ideally,  the  system  should  contain  a  small  and  tightly  constrained  set  of  param¬ 
eterized  principles. 
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